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Obj ective : Integration of carcinogenic human papillomaviruses (HP Vs) into the host genome 
is a significant tumorigenic factor in specific cancers including cervical carcinoma. Although 
major strides have been made with respect to HPV diagnosis and prevention, identification and 
development of efficacious treatments for cervical cancer patients remains a goal and thus 
requires additional detailed characterization of both somatic events and HPV integration. Given 
this need, the goal of this study was to use the next generation sequencing to simultaneously 
evaluate somatic alterations and expression changes in a patient's cervical squamous carcinoma 
lesion metastatic to the lung and to detect and analyze HPV infection in the same sample. 
Materials and Methods: We performed tumor and normal exome, tumor and normal 
shallow whole-genome sequencing, and RNA sequencing of the patient's lung metastasis. 
Results: We generated over 1.2 billion mapped reads and identified 130 somatic point 
mutations and indels, 21 genie translocations, 16 coding regions demonstrating copy number 
changes, and over 36 genes demonstrating altered expression in the tumor (corrected P < 0.05). 
Sequencing also revealed the HPV type 1 8 (HPV- 18) integration in the metastasis. Using both 
DNA and RNA reads, we pinpointed 3 major events indicating HPV-18 integration into an 
intronic region of chromosome 6p25.1 in the patient's tumor and validated these events with 
Sanger sequencing. This integration site has not been reported for HPV-18. 
Conclusions: We demonstrate that DNA and RNA sequencing can be used to concurrently 
characterize somatic alterations and expression changes in a biopsy and delineate HPV 
integration at base resolution in cervical cancer. Further sequencing will allow us to better 
understand the molecular basis of cervical cancer pathogenesis. 
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Key events that drive cancer are influenced by a multitude of 
factors that still remain to be understood. One event is 
viral infection, which is estimated to have a role in 15% 
to 20% of cancers. 1 Cancers that have been found to be 
associated with viral infection include cervical, 1 ' 2 oropha- 
ryngeal and hepatocellular carcinomas, 1 ' 3 and some leuke- 
mias and lymphomas. 14-6 One well-known association is 
the role of human papillomavirus (HPV) in cervical carci- 
noma (CC). This relationship was discovered after isolation 
and identification of HPV- 16 and HPV- 18, now designated as 
high-risk mucosal HPV subtypes, in CC biopsies. 7 ' 8 Addi- 
tional studies show that infection by these and other high-risk 
HPV types is the primary risk factor for developing CC 2 ' 9 ' 10 
such that they are identified in at least 99% of CC cases. 2 ' 11 
In 2013, 12,340 new CC diagnoses, and 4030 CC- 
associated deaths are estimated. 12 Fortunately, early detec- 
tion and preventative measures have resulted in a decrease in 
the mortality rate. However, our understanding of HPV in- 
tegration in CC is still evolving as studies point to both the 
presence of HPV integration hotspots 13 ' 14 and the absence of 
a correlation between HPV type and integration location. 13 
Furthermore, despite the well-established association between 
HPV and CC, identification of effective therapies for patients 
with CC remains a challenge and emphasizes the need for 
additional characterization of CC tumors and further evalu- 
ation ofHPVinCC. 15 

Given this demand, we applied next generation sequen- 
cing (NGS) to characterize somatic alterations in a live metas- 
tatic cervical squamous cell carcinoma patient and to detect and 
analyze HPV integration in the same sample. We performed 
whole-genome, exome, and RNA sequencing (RNAseq) from 
DNA and RNA collected from the patient's tumor biopsy 
specimen and peripheral blood sample. We describe here the 
first reported study of combined DNA and RNA sequencing 
as well as HPV integration and analysis in a live patient with 
metastatic CC. 



MATERIALS AND METHODS 

Please refer to the Supplemental data for detailed methods, 
available at http://links.lww.com/IGC/A194. 

Ethics Statement 

The patient was treated on protocols approved by the 
institutional review board of the Mayo Clinic. This study was 
conducted in accordance with the 1996 Declaration of Helsinki. 
Written informed consent was obtained from the patient for se- 
quencing analyses and data release. 

Patient Clinical History 

The patient's condition was diagnosed with stage IB2 
squamous cell carcinoma of the cervix in June 2007. The patient 
underwent numerous treatments, and a metastatic lung biopsy 
was acquired in April 2012. This lung biopsy specimen was sent 
to the CRL (Clinical Reference Laboratory, Lenexa, KS) for 
nucleic acid isolation, and the DNA/RNA was sent to the 
Translational Genomics Research Institute for sequencing. 

2 



Sample Assessment 

The patient's lung biopsy specimen was preserved as 
fresh frozen and assessed as 100% tumor squamous carcinoma. 
Direct visualization of the sample was performed to estimate the 
tumor content and the extent of tissue heterogeneity by a board- 
certified pathologist. 

DNA and RNA Isolation 

Blood leukocytes were isolated from the whole blood 
and homogenized. Genomic DNA was purified using the 
Qiagen AllPrep DNA spin column (Valencia, C A) . Tumor DNA 
and RNA isolations were performed by CRL using Qiagen 's 
AllPrep Kit. 

Library Preparation, Sequencing, and Data 
Analysis 

Isolated DNA and RNA were used to generate whole- 
genome, exome, and RNA sequencing libraries. Libraries 
were paired-end sequenced on the Illumina HiSeq 2000 (San 
Diego, CA) and analyzed after Fastq generation and align- 
ment against the human reference genome (build 37) using 
the Burrows- Wheeler Alignment 16 for DNA sequencing data 
and Bowtie/Tophat 17 ' 18 for RNA sequencing data. The dbGaP 
(database of Genotypes and Phenotypes) accession number 
for sequencing data from this study is phs000628.vl.pl. 

Experimental Validation 

To validate HPV- 18 integration sites and the presence 
of episomal HPV- 18, primers were designed upstream and 
downstream of 4 separate junctions. These junctions include the 
following: (1) E6 to long control region (LCR) (episomal HPV- 
1 8), (2) E2/E4 to chr6:4,328,779, (3) E2 to chr6:4,282,640, and 
(4) E 1 to chr6:4,29 1 ,973 . Polymerase chain reaction (PCR) was 
performed using each primer set on cDNA that was previously 
generated from tumor RNA during library construction, and 
PCR products were Sanger sequenced. Quantitative PCR was 
performed by CRL to evaluate PIK3CA expression with Fj-actin 
as the control gene. 

RESULTS AND DISCUSSION 
Whole-Genome Sequencing 

We performed shallow whole-genome sequencing (WGS) 
from DNA isolated from the lung biopsy specimen and 
whole blood sample to identify somatic copy number changes 
and translocations. Whole-genome sequencing metrics are 
listed in Table 1 , and identified translocations and copy number 
changes are summarized in Figure 1. Overall, we identified 
21 translocations (Table 2) affecting at least 1 gene. Of these 
events, 1 affected a COSMIC (Catalogue of Somatic Mutations 
in Cancer) 19 gene, TRIM27 (tripartite motif containing 27; 
RET). Point mutations in TR1M27 have been identified in 
other cancers, 20 ' 21 but no somatic translocations in this gene 
have been previously reported in CC. We additionally identified 
16 genie regions demonstrating copy number variations (CNVs) 
encompassing 354 genes (Supplemental Digital Content Fig. SI, 
available at http://links.lww.com/lGC/A194). The 16 regions en- 
compass 1 1 COSMIC genes (Table 3) including PIK3CA (phos- 
phatidylinositol-4,5-bisphosphate 3-kinase, catalytic subunit a), 
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50X2 [SRY (sex-determining region Y)-box 2], 7L7i? (interleu- 
kin 7 receptor), and (leukemia inhibitory factor receptor a). 
The gain identified on 5p also overlaps with TERT (telomerase 
reverse transcriptase), for which altered expression has been 
described in HPV-mediated CC. 22 The 2 regions of CNV loss 
(4q and 1 lq) did not overlap with COSMIC genes, but these 
events have been previously detected in CC. 23 ~ 25 

Gains in CC have been reported for regions encompassing 
IL7R and PIK3CA. A 5p amplification encompassing IL7R was 
previously identified in a primary cervical adenocarcinoma cell 
line (COSMIC ID 687509). We also identified multiple gains on 
3q, which include a region encompassing PIK3CA. The 3q 
amplification is considered a frequent somatic alteration in 
CC 26 ~ 30 Cell line studies provide evidence that suggests that 
amplified PIK3CA, along with increased expression, may acti- 
vate the PBK/Akt pathway, lead to apoptosis inhibition, and 
thereby support tumor cell growth and division. 24 ' 27 PIK3CA 
PvNAseq data are detailed later. The International Cancer 
Genome Consortium 25 has also reported events in SOX2 in CC. 

Exome Sequencing 

Using exome sequencing, we identified 130 nonsynony- 
mous point mutations, splice site point mutations, and small 
indels (insertion deletions; Fig. 1). Of these events, 3 mutations 
affected COSMIC genes including WRN (Werner syndrome, 
RecQ helicase-like) and ASPSCR1 (alveolar soft part sarcoma 
chromosome region, candidate 1 ) and have not been previously 
reported in CC (Table 4). Sorting Tolerant From Intolerant 
(SIFT) 31 and PolyPhen-2 (Polymorphism Phenotyping v2) 32 
were used to predict potential effects of selected point mutations 
on protein function (Table 4). We additionally performed allele 
frequency analysis of all identified mutations to evaluate the 
extent to which exome reads support a mutation (Supplemental 
Digital Content Table SI, available at http://links.lww.com/IGC/A194). 
Ninety-seven of 130 total mutations are supported by all 
exome reads at the mutation location. 

Additional notable nonsynonymous coding mutations that 
were identified (Table 4) include IGF1R (insulin-like growth factor 
1 receptor), SKI (v-ski sarcoma viral oncogene homolog (avian), 
mdRAD50 (RAD50 homolog [S. accharomyces cerevisiae]), and 
XRCC1 (x-ray repair complementing defective repair in Chinese 
hamster cells 1). IGF1R is normally involved in initiating signal- 
ing after binding of IGF 1 (insulin growth factor 1) to activate cell 
proliferation and inhibit apoptosis, 33 and its overexpression has 
been found in both CC cell lines 34 and specimens. 35 ' 36 We, how- 
ever, did not identify significant IGF1R expression changes (cor- 
rected P < 0.05) in the tumor. In addition, a recent study showed 
that resistance to the IGF 1 R inhibitor f igitumamab is associated 
with the absence of N-linked glycosylation at N913. 37 The mu- 
tation causing the change of asparagine to tyrosine at this position 
suggests that the tumor may be resistant to this inhibitor and 
highlights the benefit of performing WGS analyses to gain in- 
sight into therapeutic selection. 

In the proto-oncogene SKI, we identified a coding mu- 
tation that falls in the SMAD4 (mothers against DPP homolog 
4)-binding domain. SKI acts as a corepressor of SMAD proteins 
and blocks the cell's ability to stop cell growth and division. 14 ' 38 
DNA repair genes with missense mutations identified in the 
patient include RAD50 and XRCC1. RAD50 complexes with 
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MRE11 and NBN (nibrin) and identifies and repairs DNA 
damage, 33 whereas XRCC1 is involved in single-stranded break 
repair. InXRCCl, we also identified an SNP (Q399R; rs25487) 
that was reported to be associated with persistent HPV in- 
fection. 34 Although we do not have records of whether the 
sequenced patient experienced HPV persistence, this event 
suggests that the patient may be HPV positive. 

RNA Sequencing 

Commercially purchased normal cervix and normal lung 
RNA samples were prepared and sequenced to serve as the 



control(s) for differential analyses. RNA sequencing metrics are 
shown in Table 1 . Tumor RNA reads were compared against 
normal cervix RNA reads and also compared against both 
normal cervix and normal lung RNA reads. Differential anal- 
ysis of tumor compared with cervix led to the identification of 
3468 genes showing expression changes (corrected/ 3 < 0.05) of 
which 83 genes are listed in COSMIC. Analysis of tumor 
compared with both normal cervix and normal lung led to 
the identification of 2338 genes (corrected P < 0.05), with 5 1 
genes listed in COSMIC. Differentially expressed COSMIC 
genes are listed in Supplementary Table S2, available at 




FIGURE 1. Circos plot summarizing somatic events. A summary of all identified somatic genomic alterations is 
shown. Translocations are marked by purple (interchromosomal) and gray (intrachromosomal translocations) lines; 
for intrachromosomal translocations, the gray connecting line may appear as a single line if the joined regions lie 
within 2000 kb. CNVs are shown along the thick gray ring encircling the translocations (green, regions of 
loss; red, regions of gain; gray, no change); on the ring encircling CNVs, somatic indels (insertion/deletions) are 
marked by light blue tick marks and on the ring encircling the indels, somatic point mutations are marked by dark 
blue tick marks. Because we identified 1 22 nonsynonymous point mutations, splice site point mutations, and 
small indels, selected gene labels associated with point mutations are shown along the outermost area of the plot. 
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TABLE 2. Identified genie intrachromosomal and interchromosomal translocations 



Name 


No. Supporting Reads 


Distance Between 
Breakpoints (bp) 


Gene 1 


Gene 2 
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http://links.lww.com/lGC/A194. In a separate analysis, we 
additionally identified RNA reads that support mutations 
identified through exome sequencing (Supplemental Digital 
Content Table SI). Overall, 35 of 130 mutations identified in 
exome data are supported by 100% of RNA reads at the re- 
spective mutation location. Supplemental data are available 
at http://links.lww.com/lGC/A194. 

Consolidation of both RNAseq analyses led to the 
identification of 36 common differentially expressed COSMIC 
genes. Upon evaluation against the Cervical Cancer Gene 
Database, 9 7 of the 36 genes were found to be previously 
described in CC. These genes include DEK (DEK oncogene), 
FHIT (fragile histidine triad), GATA3 (GATA-binding protein 3), 
and KIT (v-kit Hardy-Zuckerman 4 feline sarcoma viral onco- 
gene homolog). 

For COSMIC genes that fell in CNVs, we evaluated if 
these genes also demonstrated expression changes in the 
tumor (Supplemental Digital Content Table S3, available 
at http://links.lww.com/IGC/A194). Overall, FANCD2 (Fanconi 
anemia, complementation group D2) and SOX2, which both fell 
in regions of copy number gains, showed increased expression in 
the tumor (corrected P < 0.05) in both RNAseq comparisons. 
However, PPARG (peroxisome proliferators-activated receptor 
7) and L1FR, which also fell in areas of gain, demonstrated de- 
creased expression in the tumor. Low PPARG expression 
was also previously reported in CC. 36 Given the P1K3CA am- 
plification that was identified we evaluated P1K3CA, which 
demonstrated an increased expression in the tumor but was not 



initially noted as it did not pass multiple testing corrections. In the 
tumor versus normal cervix analysis, the tumor demonstrated a 
log2 fold of 1 .30 (P = 0.03), whereas in the tumor versus normal 
cervix and lung analysis, the tumor demonstrated a log2 fold of 
1.19 (P = 0.126). Quantitative PCR validation confirmed 
P1K3CA overexpression in the tumor as compared against (3- 
actin (8 Ct = 8.7; Supplemental Digital Content Table S4, 
available at http://links.lww.com/IGC/A194). 

Pathway analysis was performed on each set of differential 
analyses using MetaCore GeneGo (corrected P < 0.05). The top 
10 affected pathways for each analysis are listed in Supple- 
mentary Table S5, available at http://links.lww.com/IGC/A194. 
Overall, pathways that are likely to be the most affected by 
identified expression changes include cell cycle processes, DNA 
methylation, stromal-epithelial interactions, cell adhesion, and 
DNA damage processes. Although these pathways are com- 
monly affected across cancers, this information may lend addi- 
tional contextual insight into CC. 



HPV Detection 

To determine if high-risk HPV integration had occurred 
in the patient, WGS reads were mapped against all 552 human 
viral reference sequences posted on NCBI's (National Center 
for Biotechnology Information) Entrez Genome Viral Genomes 
Resource. These 552 references include 42 HPV genomes, 
including those of HPV- 16 and HPV- 18. Results from align- 
ment against all 42 HPV references are shown in the Supplementary 
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Table S6, available at http://links.lww.com/lGC/A194. Reads 
aligning to HPV genomes were only aligned to HPV- 1 6 or HP V- 1 8 . 
Over 19,000 tumor DNA reads were mapped to the HPV- 18 
genome, whereas no germline reads were mapped (Table 1). 
In the tumor, 49 reads were mapped to HPV- 1 6 , but no germline 
reads were mapped. Evaluation of HPV- 16 reads indicated that 
mapped reads were discordant such that read pairs were mapped 
to HPV-1 8 and the mitochondrial genome. Confidence in these 
reads is low because of low mapping quality and because of 
66% sequence homology between the HPV- 16 and HPV- 18 
genomes. These analyses indicate that HPV- 18 integration 
had occurred in the metastasis. 

HPV-1 8 Integration Analysis 

The HPV- 18 genome consists of 6 early genes (El, E2, 
E4, E5, E6, E7) and 2 late genes (LI, L2). The late genes code 
for viral capsid proteins, and the early genes include viral 
oncogenes (E6 and E7) and code for proteins involved in the 
maintenance of transformation and viral replication. E2 codes 
for a transcriptional repressor that normally inhibits E6 and 
E7 expression. Loss of E2 expression occurs after HPV in- 
tegration into the host genome and allows for E6 and E7 

35 37 38 

expression. ' ' Analysis of whole-genome and RNA data 
indicated that portions of HPV- 18 integrated into multiple 
locations within a nongenic region of chromosome (chr) 6 
(position 4,280,617-4,331,314; 6p25.1), which also overlaps 
with a region of copy number gain and is a common fragile 
site (FRA6B). 14 This finding correlates with a study that 
reported that 63% of HPV-1 8 integrations occur within 
common fragile sites and that significant structural events 
often surround integration sites. 40 In addition, the chr6p25.1 
region that we identified has been reported as an HPV- 16 
integration site in CC 14 but has not been reported for HPV-1 8. 

The evaluation of DNA and RNA reads aligning to 
HPV- 18 indicated that a portion of HPV- 18 DNA in the tumor 
cells remained in the episomal form, a phenotype that has 
been previously reported for HPV-16 in CC. 41 Upon inves- 
tigating the 3' end of the noncoding LCR in HPV-18, we 
found that approximately two thirds of DNA reads were 
mapped back to the E6 region to indicate that these reads were 
generated from episomal HPV-18 DNA. The presence of both 
genomic states of HPV- 1 8 in the sequenced tumor illustrates 
the complexity of HPV integration and emphasizes that both 
spatial and temporal analyses are needed to fully understand 
the role of HPV in CC. 

Visual inspection of DNA and RNA reads suggested 
that multiple integration events occurred. Because of the 
multicellularity of the sequenced sample, this phenotype sug- 
gests that different cells may have undergone different inte- 
gration events. We assembled RNA reads from HPV-18 and 
discordant reads from chr6p25.1 that mapped to HPV-18 to 
pinpoint the major breakage events in expressed transcripts. 
Figure 2 illustrates the mapped HPV-18 RNA reads (A), as- 
sembled contigs generated from the HPV-18 RNA reads (B), 
and a linearized schematic of the HPV- 1 8 genome and the chr6 
region into which HPV-18 integrated (C). Overall, we assem- 
bled 1 55,884 paired reads to generate 20 contigs that ranged in 
size from 220 to 11, 616 base pairs . Contigs that fully or partially 
aligned to HPV-18 are also shown. 



TABLE 3. Identified copy number alterations affecting 
coding regions 
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We identified a contig (3B; red) that spanned HPV-1 8's 
E6 and LCR junction to provide additional evidence of the 
presence of HPV-18 episomes in the sequenced tumor and 
also demonstrated that transcription across the E6/LCR 
junction occurs in HPV-18 episomes. Overall, we identified 3 
major breaks in the HPV-18 genome (Figs. 2B-C). The breaks 
shown in Figure 2C colored purple and pink may represent 
different points (in different cells) where the circular viral ge- 
nome opened during integration as an early event in HPV-18 
integration is the disruption of the E2 gene to allow for 
genome linearization. A caveat with these analyses is that the 
contigs are a representation of the most commonly occurring 
events. Additional events that are less common in the cells 
of the sequenced biopsy may not be captured using this 
approach. 

Transcriptomic Analysis of HPV-18 Gene 
Expression 

We additionally used the tumor HPV- 1 8 reads to evaluate 
expression of HPV-18 genes. Because of the absence of a 
control dataset for the HPV-18 genes, we evaluated the number 
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TABLE 4. Selected SNVs identified through exome sequencing 
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of reads that were acquired for each gene to determine the level 
of expression (Supplemental Digital Content Table S7, avail- 
able at http://links.lww.com/IGC/A194). We identified 4103 
reads mapping to E2, but no expression of integrated E2 re- 
gions, as is expected because integration disrupts E2 to sub- 
sequently allow for increased expression of E6 and E7, which 
we also identified in the patient's tumor. We also identified 
expression of E5 — loss of E5 expression is associated with 
HPV integration 42 and thus correlates with the presence of 
HPV-18 episomes in the biopsied tumor cells. In conjunction 
with high levels of E6 and E7 expression (3774 and 2922 reads, 
respectively) and integration of both genes into chr6, we see 
over 550X mapped reads in the chromosome 6 region affected 
by HPV-18 integration. This phenotype is likely the result of 
viral promotion of E6 and E7 expression, which subsequently 
promotes expression of the intronic chromosome 6 regions 
situated at the 3' end of the integrated viral oncogenes. 

After HPV integration, HPV proteins acquire control of 
host cellular pathways. We thus evaluated expression changes 
of genes whose products are affected by HPV proteins. These 
genes include TP53 (tumor protein p53), WAF1/CDKN1A 
(p21; cycle-dependent kinase inhibitor 1A), RBI (retino- 
blastoma 1), CCNE2 (cyclin E2), E2F1 (E2F transcription 
factor 1), CDKN2A (cyclin-dependent kinase inhibitor 2A), 
BRD4 (bromodomain containing 4), and CDC25A (cell di- 
vision cycle 25A; Supplemental Digital Content Table S8, 
available at http://links.lww.com/IGC/A194). These genes 
have roles in 2 key tumor suppressor pathways affected by 
HPV-18's E6 and E7 genes. Overall, we identified increased 
expression of TP53, CCNE2, E2F1, CDKN2A, and CDC25A 
(corrected P< 0.05). HPV-18's E6 protein targets and inhibits 



p53 such that the up-regulated TP 5 3 expression identified in 
the tumor may represent a compensatory response. Further- 
more, p53 is activated by pi 4/CDKNA2 A such that up-regulated 
expression of CDKN2A may also represent a response to in- 
hibition of p53's functions. HPV-18's E7 protein also degrades 
RB 1 and inhibits the RB 1 tumor suppressor pathway causing 
abnormal gene expression normally controlled by E2F, CDK2 
complex activation, and increasing CDC25 protein. The sig- 
nificantly up-regulated expression of CCNE2, E2F1, and 
CDC25A that we identified thus correlate with RB 1 pathway 
inhibition that would result from E7 expression. 

HPV-18 Integration Validation 

To validate the presence of the 4 identified junctions 
resulting from HPV-18 integration and the presence of epi- 
somal HPV-18 DNA, we performed PCR of regions spanning 
the 4 junctions on cDNA generated from tumor RNA. These 
junctions include HPV-18 LCR-E6 (found in episomal forms 
of HPV-18), El-chr6:4,291,973, E2-chr6:4,282,640, and E2/ 
E4-chr6:4,328,779. PCR products were Sanger sequenced, 
and resulting chromatograms confirmed the presence of the 4 
junctions (Supplemental Digital Content Figure S2, available 
at http://links.lww.com/IGC/A194). 

CONCLUSIONS 

In this study, we used NGS to characterize somatic 
alterations in a patient with CC as well as to identify and ana- 
lyze HPV integration. Although combined DNA and RNA 
sequencing has been performed for other cancers, this study is 
the first to apply this approach to a patient with CC and to 
concurrently analyze HPV integration in the same sequencing 
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4,282,640 4,291,973 4,328,779 

FIGURE 2. HPV-1 8 integration analysis. This schematic summarizes the major integration events that were 
detected using RNA data. Integrated Genomics Viewer was used to visualize RNA reads mapping to HPV-1 8 (A). 
A coverage track at the top of (A) illustrates HPV-1 8 expression levels. Orange and dark red coloring mark discordant 
reads that fall at breakpoints, whereas gray reads mapped without discordance. B, The figure shows the 
contigs assembled from the reads such that peach-colored portions of the contigs correspond to chromosome 
6 sequences and blue-colored portions of the contigs correspond to HPV-1 8 sequences. The red contig spans the 
LCR and E6 regions of HPV-1 8 to indicate the presence of episomal DNA. The locations of identified breakpoints are 
shown at the top of panel B with green, purple, and pink arrows, and lineup with breakpoints identified in panel A. 
C, The figure shows a linearized view of the HPV-1 8 genome (blue) and the chromosome 6 region (peach) 
where HPV-1 8 reads were aligned. As shown in panel B and C, 3 separate events were identified (demarcated 
by green, purple, and pink arrows in panel B and corresponding green, purple, and pink connecting lines in panel C). 



data. Before sequencing, the patient's HPV status was not 
known but using NGS, we determined that HPV- 18 integration 
had occurred in the patient's tumor and subsequently identified 
a novel integration site on chr6p25.1. We also found that al- 
though some tumor cells in the biopsy garnered episomal HPV- 
18, other cells had undergone HPV- 18 integration. Although 
this phenotype has been reported for HPV- 1 6 in CC, this finding 
reiterates the need to consider both spatial and temporal anal- 
yses to fully understand the role of HPV in CC. The approach 
used in this study is relevant not only to HPV-associated car- 
cinomas but also to other malignancies involving viral infec- 
tion. Furthermore, using NGS to simultaneously characterize a 
tumor genome and to analyze HPV integration is relevant for 
CC because HPV integration is not the sole driving event in CC 
as additional debilitating events are required for carcinogene- 
sis. 43 As we continue to sequence CC tumors, we set the 
foundation for improving our molecular knowledge of HPV, 
CC, and HPV in CC. 
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